Search CORE

122 research outputs found

Ensembling classical machine learning and deep learning approaches for morbidity identification from clinical notes

Author: Helaoui R.
Kumar V.
Reforgiato Recupero D.
Riboni D.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2021
Field of study

The past decade has seen an explosion of the amount of digital information generated within the healthcare domain. Digital data exist in the form of images, video, speech, transcripts, electronic health records, clinical records, and free-text. Analysis and interpretation of healthcare data is a daunting task, and it demands a great deal of time, resources, and human effort. In this paper, we focus on the problem of co-morbidity recognition from patient’s clinical records. To this aim, we employ both classical machine learning and deep learning approaches.We use word embeddings and bag-of-words representations, coupled with feature selection techniques. The goal of our work is to develop a classification system to identify whether a certain health condition occurs for a patient by studying his/her past clinical records. In more detail, we have used pre-trained word2vec, domain-trained, GloVe, fastText, and universal sentence encoder embeddings to tackle the classification of sixteen morbidity conditions within clinical records. We have compared the outcomes of classical machine learning and deep learning approaches with the employed feature representation methods and feature selection methods. We present a comprehensive discussion of the performances and behaviour of the employed classical machine learning and deep learning approaches. Finally, we have also used ensemble learning techniques over a large number of combinations of classifiers to improve the single model performance. For our experiments, we used the n2c2 natural language processing research dataset, released by Harvard Medical School. The dataset is in the form of clinical notes that contain patient discharge summaries. Given the unbalancedness of the data and their small size, the experimental results indicate the advantage of the ensemble learning technique with respect to single classifier models. In particular, the ensemble learning technique has slightly improved the performances of single classification models but has greatly reduced the variance of predictions stabilizing the accuracies (i.e., the lower standard deviation in comparison with single classifiers). In real-life scenarios, our work can be employed to identify with high accuracy morbidity conditions of patients by feeding our tool with their current clinical notes. Moreover, other domains where classification is a common problem might benefit from our approach as well

Archivio istituzionale della ricerca - Università di Cagliari

TF-IDF vs word embeddings for morbidity identification in clinical notes: An initial study

Author: Dessi D.
Helaoui R.
Kumar V.
Reforgiato Recupero D.
Riboni D.
Publication venue: CEUR-WS
Publication date: 01/01/2020
Field of study

Today, we are seeing an ever-increasing number of clinical notes that contain clinical results, images, and textual descriptions of patient's health state. All these data can be analyzed and employed to cater novel services that can help people and domain experts with their common healthcare tasks. However, many technologies such as Deep Learning and tools like Word Embeddings have started to be investigated only recently, and many challenges remain open when it comes to healthcare domain applications. To address these challenges, we propose the use of Deep Learning and Word Embeddings for identifying sixteen morbidity types within textual descriptions of clinical records. For this purpose, we have used a Deep Learning model based on Bidirectional Long-Short Term Memory (LSTM) layers which can exploit state-of-the-art vector representations of data such as Word Embeddings. We have employed pre-trained Word Embeddings namely GloVe and Word2Vec, and our own Word Embeddings trained on the target domain. Furthermore, we have compared the performances of the deep learning approaches against the traditional tf-idf using Support Vector Machine and Multilayer perceptron (our baselines). From the obtained results it seems that the latter outperform the combination of Deep Learning approaches using any word embeddings. Our preliminary results indicate that there are specific features that make the dataset biased in favour of traditional machine learning approaches

Archivio istituzionale della ricerca - Università di Cagliari

A local feature engineering strategy to improve network anomaly detection

Author: Carta S.
Podda A. S.
Recupero D. R.
Saia R.
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

The dramatic increase in devices and services that has characterized modern societies in recent decades, boosted by the exponential growth of ever faster network connections and the predominant use of wireless connection technologies, has materialized a very crucial challenge in terms of security. The anomaly-based intrusion detection systems, which for a long time have represented some of the most efficient solutions to detect intrusion attempts on a network, have to face this new and more complicated scenario. Well-known problems, such as the difficulty of distinguishing legitimate activities from illegitimate ones due to their similar characteristics and their high degree of heterogeneity, today have become even more complex, considering the increase in the network activity. After providing an extensive overview of the scenario under consideration, this work proposes a Local Feature Engineering (LFE) strategy aimed to face such problems through the adoption of a data preprocessing strategy that reduces the number of possible network event patterns, increasing at the same time their characterization. Unlike the canonical feature engineering approaches, which take into account the entire dataset, it operates locally in the feature space of each single event. The experiments conducted on real-world data showed that this strategy, which is based on the introduction of new features and the discretization of their values, improves the performance of the canonical state-of-the-art solutions

Archivio istituzionale della ricerca - Università di Cagliari

A holistic auto-configurable ensemble machine learning strategy for financial trading

Author: Carta S.
Corriga A.
Ferreira A.
Recupero D. R.
Saia R.
Publication venue: 'MDPI AG'
Publication date: 01/01/2019
Field of study

Financial markets forecasting represents a challenging task for a series of reasons, such as the irregularity, high fluctuation, noise of the involved data, and the peculiar high unpredictability of the financial domain. Moreover, literature does not offer a proper methodology to systematically identify intrinsic and hyper-parameters, input features, and base algorithms of a forecasting strategy in order to automatically adapt itself to the chosen market. To tackle these issues, this paper introduces a fully automated optimized ensemble approach, where an optimized feature selection process has been combined with an automatic ensemble machine learning strategy, created by a set of classifiers with intrinsic and hyper-parameters learned in each marked under consideration. A series of experiments performed on different real-world futures markets demonstrate the effectiveness of such an approach with regard to both to the Buy and Hold baseline strategy and to several canonical state-of-the-art solutions

Multidisciplinary Digital Publishing Institute

Archivio istituzionale della ricerca - Università di Cagliari

Popularity prediction of instagram posts

Author: Carta S.
Podda A. S.
Recupero D. R.
Saia R.
Usai G.
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

Predicting the popularity of posts on social networks has taken on significant importance in recent years, and several social media management tools now offer solutions to improve and optimize the quality of published content and to enhance the attractiveness of companies and organizations. Scientific research has recently moved in this direction, with the aim of exploiting advanced techniques such as machine learning, deep learning, natural language processing, etc., to support such tools. In light of the above, in this work we aim to address the challenge of predicting the popularity of a future post on Instagram, by defining the problem as a classification task and by proposing an original approach based on Gradient Boosting and feature engineering, which led us to promising experimental results. The proposed approach exploits big data technologies for scalability and efficiency, and it is general enough to be applied to other social media as well

Archivio istituzionale della ricerca - Università di Cagliari

A blockchain-based distributed paradigm to secure localization services

Author: Fenu G.
Podda A. S.
Pompianu L.
Reforgiato Recupero D.
Saia R.
Publication venue: 'MDPI AG'
Publication date: 01/01/2021
Field of study

In recent decades, modern societies are experiencing an increasing adoption of interconnected smart devices. This revolution involves not only canonical devices such as smartphones and tablets, but also simple objects like light bulbs. Named the Internet of Things (IoT), this ever-growing scenario offers enormous opportunities in many areas of modern society, especially if joined by other emerging technologies such as, for example, the blockchain. Indeed, the latter allows users to certify transactions publicly, without relying on central authorities or intermediaries. This work aims to exploit the scenario above by proposing a novel blockchain-based distributed paradigm to secure localization services, here named the Internet of Entities (IoE). It represents a mechanism for the reliable localization of people and things, and it exploits the increasing number of existing wireless devices and blockchain-based distributed ledger technologies. Moreover, unlike most of the canonical localization approaches, it is strongly oriented towards the protection of the users’ privacy. Finally, its implementation requires minimal efforts since it employs the existing infrastructures and devices, thus giving life to a new and wide data environment, exploitable in many domains, such as e-health, smart cities, and smart mobility

Archivio istituzionale della ricerca - Università di Cagliari

Recommended from our members

Finding Web-Based Anxiety Interventions on the World Wide Web: A Scoping Review

Author: Andrews G
Chambless DL
Ellis L
Eysenbach G
Fox E
Hickie IB
Hofmann SG
Kaltenthaler E
Kenwright M
Kessler R
Kroenke K
Levene M
Marks IM
McCrone P
Mechanic D
Proudfoot J
Proudfoot J
Recupero PR
Schonfeld WH
Slegg G
Somers JM
Publication venue: 'JMIR Publications Inc.'
Publication date: 01/06/2016
Field of study

BACKGROUND: One relatively new and increasingly popular approach of increasing access to treatment is Web-based intervention programs. The advantage of Web-based approaches is the accessibility, affordability, and anonymity of potentially evidence-based treatment. Despite much research evidence on the effectiveness of Web-based interventions for anxiety found in the literature, little is known about what is publically available for potential consumers on the Web. OBJECTIVE: Our aim was to explore what a consumer searching the Web for Web-based intervention options for anxiety-related issues might find. The objectives were to identify currently publically available Web-based intervention programs for anxiety and to synthesize and review these in terms of (1) website characteristics such as credibility and accessibility; (2) intervention program characteristics such as intervention focus, design, and presentation modes; (3) therapeutic elements employed; and (4) published evidence of efficacy. METHODS: Web keyword searches were carried out on three major search engines (Google, Bing, and Yahoo-UK platforms). For each search, the first 25 hyperlinks were screened for eligible programs. Included were programs that were designed for anxiety symptoms, currently publically accessible on the Web, had an online component, a structured treatment plan, and were available in English. Data were extracted for website characteristics, program characteristics, therapeutic characteristics, as well as empirical evidence. Programs were also evaluated using a 16-point rating tool. RESULTS: The search resulted in 34 programs that were eligible for review. A wide variety of programs for anxiety, including specific anxiety disorders, and anxiety in combination with stress, depression, or anger were identified and based predominantly on cognitive behavioral therapy techniques. The majority of websites were rated as credible, secure, and free of advertisement. The majority required users to register and/or to pay a program access fee. Half of the programs offered some form of paid therapist or professional support. Programs varied in treatment length and number of modules and employed a variety of presentation modes. Relatively few programs had published research evidence of the intervention's efficacy. CONCLUSIONS: This review represents a snapshot of available Web-based intervention programs for anxiety that could be found by consumers in March 2015. The consumer is confronted with a diversity of programs, which makes it difficult to identify an appropriate program. Limited reports and existence of empirical evidence for efficacy make it even more challenging to identify credible and reliable programs. This highlights the need for consistent guidelines and standards on developing, providing, and evaluating Web-based interventions and platforms with reliable up-to-date information for professionals and consumers about the characteristics, quality, and accessibility of Web-based interventions

City Research Online

Crossref

PubMed Central

Protons in near earth orbit

Author: A. Arefiev
A. Biland
A. Chiarini
A. Contin
A. Cotta–Ramusino
A. Hasan
A. Klimentov
A. Lebedev
A. Margotti
A. Mihul
A. Mourao
A. Mujunen
A. Papi
A. Pesci
A. Pevsner
A. Schultz von Dratzig
A. Zichichi
Ahlen
B. Alpat
B. Bertucci
B. Meillon
B. Verlaat
B.C. Wang
C. Camps
C. Maña
C. Roissin
C. Williams
C.C. Feng
C.G. Yang
C.L. Liu
D. Alvisi
D. Barancourt
D. Casadei
D. Grandi
D. Luckey
D. Rapin
D. Ren
D. Santos
D. Son
D. Vité
D.X. Zhao
E. Babucci
E. Fiandrini
E. Perrin
E. Prati
E. Riihonen
E. Shoumilov
E. Valtonen
E. Velikhov
E.S. Seo
F. Barao
F. Cindolo
F. Finelli
F. Massera
F. Mayet
F. Mezzanotte
F. Palmonari
F. Pauss
F. Raupach
F. Velcea
F. Vezzu
F.J. Eppling
G. Ambrosi
G. Barbier
G. Barreira
G. Boella
G. Bruni
G. Castellini
G. Esposito
G. Fluegge
G. Kenney
G. Laborie
G. Lamanna
G. Laurenti
G. Levi
G. Lu
G. Molinari
G. Pancaldi
G. Sartorelli
G. Schwering
G. Torromeo
G. Viertel
G.S. Sun
G.Y. Zhu
H. Anderhub
H. Hofer
H. Postema
H. Suter
H. Von Gunten
H.F. Chen
H.L. Zhuang
H.S. Chen
H.T. Liu
H.Y. Zhang
Hart
I. D'Antone
I. Lopes
I. Usoskin
I. Vetlitsky
I.H. Park
J. Alcaraz
J. Berdugo
J. Casaus
J. Engelberg
J. Favier
J. Kenny
J. Ritakari
J. Torsti
J. Trümper
J. Ulbricht
J. Vandenhirtz
J.D. Burger
J.D. Deus
J.L. Yan
J.P. da Cunha
J.P. Richeux
J.P. Vialle
J.Z. Wang
K. Hangarter
K. Karlamaa
K. Lübelsmeyer
K. Wiik
L. Ao
L. Baldini
L. Bellagamba
L. Djambazov
L.G. Yan
M. Basile
M. Boschini
M. Bourquin
M. Buenerd
M. Capell
M. Cristinziani
M. Gervasi
M. Ionica
M. Jongmanns
M. Lolli
M. Menichelli
M. Pauluzzi
M. Pimenta
M. Ribordy
M. Steuer
M. Tornikoski
M. Yang
M.A. Huang
N. Dinu
N. Fouque
N. Produit
N.A. Chernoplekov
P. Azzarello
P. Berges
P. Béné
P. Cannarsa
P. Crespo
P. Emonet
P. Extermann
P. Giusti
P. Levtchenko
P. Yeh
P.C. Xia
P.G. Rancoita
P.H. Fisher
R. Battiston
R. Becker
R. Cavalletti
R. Flaminio
R. Ionica
R. Kossakowski
R. Mezzenga
R. Pilastrini
R. Sagdeev
R. Siedling
R.R. McNeil
S. Bizzaglia
S. Blasko
S. Recupero
S. Urpo
S. Waldmeier Wicki
S.C. Lee
S.M. Ting
S.W. Ye
S.X. Wu
Samuel C.C. Ting
T. Eronen
T. Laitinen
T. Song
T.H. Chiueh
T.S. Dai
U. Becker
U. Roeser
V. Commichau
V. Hermel
V. Koutsenko
V. Plyaskin
V. Pojidaev
V. Postolache
V. Shoutko
W. Hungerford
W. Karpinski
W. Kim
W. Lustermann
W. Wallraff
W.J. Burger
W.Q. Gu
W.Z. Zhu
X.D. Cai
X.W. Tang
Y.H. Chang
Y.H. Wang
Y.L. Chuang
Y.S. Lu
Yu. Galaktionov
Z. Ren
Z.G. Chen
Z.P. Zhang
Z.R. Dong
Z.Z. Xu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2000
Field of study

The proton spectrum in the kinetic energy range 0.1 to 200 GeV was measured by the Alpha Magnetic Spectrometer (AMS) during space shuttle flight STS-91 at an altitude of 380 km. Above the geomagnetic cutoff the observed spectrum is parameterized by a power law. Below the geomagnetic cutoff a substantial second spectrum was observed concentrated at equatorial latitudes with a flux ~ 70 m^-2 sec^-1 sr^-1. Most of these second spectrum protons follow a complicated trajectory and originate from a restricted geographic region.Comment: 19 pages, Latex, 7 .eps figure

arXiv.org e-Print Archive

Crossref

HAL-IN2P3

Hal - Université Grenoble Alpes

UCL Discovery

Publikationsserver der RWTH Aachen University

HAL Université de Savoie

CERN Document Server

Search for antihelium in cosmic rays

Author: A. Arefiev
A. Biland
A. Chiarini
A. Contin
A. Cotta-Ramusino
A. Hasan
A. Klimentov
A. Lebedev
A. Margotti
A. Mihul
A. Mourao
A. Mujunen
A. Papi
A. Pesci
A. Pevsner
A. SchultzvonDratzig
A. Zichichi
Ahlen
B. Alpat
B. Bertucci
B. Meillon
B. Verlaat
B.C. Wang
Badhwar
Buffington
C. Camps
C. Maña
C. Roissin
C. Williams
C.C. Feng
C.G. Yang
C.L. Liu
D. Alvisi
D. Barancourt
D. Casadei
D. Luckey
D. Rapin
D. Ren
D. Santos
D. Son
D. Vité
D.X. Zhao
E. Babucci
E. Fiandrini
E. Perrin
E. Prati
E. Riihonen
E. Shoumilov
E. Valtonen
E. Velikhov
E.A. Werner
F. Barao
F. Cindolo
F. Finelli
F. Massera
F. Mayet
F. Mezzanotte
F. Palmonari
F. Pauss
F. Raupach
F. Tenbusch
F. Vezzu
F.J. Eppling
G. Ambrosi
G. Barbier
G. Barreira
G. Boella
G. Bruni
G. Castellini
G. Esposito
G. Fluegge
G. Kenney
G. Laborie
G. Lamanna
G. Laurenti
G. Levi
G. Lu
G. Maehlum
G. Molinari
G. Pancaldi
G. Sartorelli
G. Schwering
G. Torromeo
G. Viertel
G.S. Sun
G.Y. Zhu
Golden
H. Anderhub
H. Hofer
H. Postema
H. Suter
H. VonGunten
H.L. Zhuang
H.S. Chen
H.T. Liu
H.Y. Zhang
I. D'Antone
I. Lopes
I. Usoskin
I. Vetlitsky
I.H. Park
J. Alcaraz
J. Berdugo
J. Casaus
J. Favier
J. Isbert
J. Kenny
J. Krieger
J. Ritakari
J. Torsti
J. Trümper
J. Ulbricht
J. Vandenhirtz
J.D. Burger
J.D. Deus
J.L. Yan
J.P. daCunha
J.P. Richeux
J.P. Vialle
J.P. Wefel
J.Z. Wang
K. Hangarter
K. Lübelsmeyer
L. Ao
L. Baldini
L. Bellagamba
L. Djambazov
L.G. Yan
L.K. Ding
M. Basile
M. Bourquin
M. Buenerd
M. Capell
M. Cristinziani
M. Gervasi
M. Ionica
M. Jongmanns
M. Lolli
M. Menichelli
M. Pauluzzi
M. Pimenta
M. Ribordy
M. Steuer
M. Yang
M.A. Huang
Moiseev
N. Dinu
N. Fouque
N. Produit
N.A. Chernoplekov
Ormes
P. Azzarello
P. Berges
P. Béné
P. Cannarsa
P. Crespo
P. Emonet
P. Extermann
P. Giusti
P. Levtchenko
P. Yeh
P.C. Xia
P.G. Rancoita
P.H. Fisher
R. Battiston
R. Becker
R. Cavalletti
R. Flaminio
R. Ionica
R. Kossakowski
R. Mezzenga
R. Pilastrini
R. Sagdeev
R. Siedling
R.R. McNeil
S. Bizzaglia
S. Blasko
S. Recupero
S. Urpo
S. WaldmeierWicki
S.C. Lee
S.M. Ting
S.X. Wu
Saeki
Samuel C.C. Ting
Smoot
Steigman
T. Eronen
T. Laitinen
T. Song
T.G. Guzik
T.H. Chiueh
T.P. Li
T.S. Dai
U. Becker
U. Roeser
V. Commichau
V. Hermel
V. Koutsenko
V. Plyaskin
V. Pojidaev
V. Shoutko
W. Hungerford
W. Karpinski
W. Kim
W. Lustermann
W. Wallraff
W.J. Burger
W.Q. Gu
W.Z. Zhu
X.D. Cai
X.W. Tang
Y.H. Chang
Y.H. Wang
Y.L. Chuang
Y.S. Lu
Yu. Galaktionov
Z. Ren
Z.G. Chen
Z.R. Dong
Publication venue: 'Elsevier BV'
Publication date: 01/01/1999
Field of study

The Alpha Magnetic Spectrometer (AMS) was flown on the space shuttle Discovery during flight STS-91 in a 51.7 degree orbit at altitudes between 320 and 390 km. A total of 2.86 * 10^6 helium nuclei were observed in the rigidity range 1 to 140 GV. No antihelium nuclei were detected at any rigidity. An upper limit on the flux ratio of antihelium to helium of < 1.1 * 10^-6 is obtained.Comment: 18 pages, Latex, 9 .eps figure

arXiv.org e-Print Archive

Crossref

HAL-IN2P3

Hal - Université Grenoble Alpes

Publikationsserver der RWTH Aachen University

HAL Université de Savoie

CERN Document Server